Pass-Fail Testing: Statistical Requirements and Interpretations
نویسندگان
چکیده
Performance standards for detector systems often include requirements for probability of detection and probability of false alarm at a specified level of statistical confidence. This paper reviews the accepted definitions of confidence level and of critical value. It describes the testing requirements for establishing either of these probabilities at a desired confidence level. These requirements are computable in terms of functions that are readily available in statistical software packages and general spreadsheet applications. The statistical interpretations of the critical values are discussed. A table is included for illustration, and a plot is presented showing the minimum required numbers of pass-fail tests. The results given here are applicable to one-sided testing of any system with performance characteristics conforming to a binomial distribution.
منابع مشابه
Bioequivalence Approach for Whole Effluent Toxicity Testing
Increased use of whole effluent toxicity (WET) tests in the regulatory arena has brought increased concern over the statistical analysis of WET test data and the determination of toxicity. One concern is the issue of statistical power. A number of WET tests may pass the current hypothesis test approach because they lack statistical power to detect relevant toxic effects because of large within-...
متن کاملEvaluation of Statistical Outlier Rejection Methods for IDDQ Testing
The quiescent current testing (IDDQ testing) for CMOS ICs provides several advantages over other testing methods. However, the future of IDDQ testing is threatened by increased sub-threshold leakage current for new technologies. The conventional pass/fail limit setting methodology cannot survive in its present form. In this paper we evaluate two statistical outlier rejection methods – the Chauv...
متن کاملMeasuring Hospital Performance Using Mortality Rates: An Alternative to the RAMR
Background The risk-adjusted mortality rate (RAMR) is used widely by healthcare agencies to evaluate hospital performance. The RAMR is insensitive to case volume and requires a confidence interval for proper interpretation, which results in a hypothesis testing framework. Unfamiliarity with hypothesis testing can lead to erroneous interpretations by the public and other stakeholders. We argue t...
متن کاملThe reliability of the pass/fail decision for assessments comprised of multiple components
OBJECTIVE The decision having the most serious consequences for a student taking an assessment is the one to pass or fail that student. For this reason, the reliability of the pass/fail decision must be determined for high quality assessments, just as the measurement reliability of the point values. Assessments in a particular subject (graded course credit) are often composed of multiple compon...
متن کاملTranslation Evaluation in Educational Settings for Training Purposes
The following article describes different methods and techniques used in educational settings for translation evaluation. Translation evaluation is the placing of value on a translation i.e. awarding a mark, even if only a binary pass/fail one. In the present study, different features of the texts chosen for evaluation were firstly considered and then scoring the t...
متن کامل